Parallel Blob Extraction Using the Multi-core Cell Processor
نویسندگان
چکیده
The rapid increase in pixel density and frame rates of modern imaging sensors is accelerating the demand for fine-grained and embedded parallelization strategies to achieve real-time implementations for video analysis. The IBM Cell Broadband Engine (BE) processor has an appealing multi-core chip architecture with multiple programming models suitable for accelerating multimedia and vector processing applications. This paper describes two parallel algorithms for blob extraction in video sequences: binary morphological operations and connected components labeling (CCL), both optimized for the Cell-BE processor. Novel parallelization and explicit instruction level optimization techniques are described for fully exploiting the computational capacity of the Synergistic Processing Elements (SPEs) on the Cell processor. Experimental results show significant speedups ranging from a factor of nearly 300 for binary morphology to a factor of 8 for CCL in comparison to equivalent sequential implementations applied to High Definition (HD) video.
منابع مشابه
Ultra-Low-Energy DSP Processor Design for Many-Core Parallel Applications
Background and Objectives: Digital signal processors are widely used in energy constrained applications in which battery lifetime is a critical concern. Accordingly, designing ultra-low-energy processors is a major concern. In this work and in the first step, we propose a sub-threshold DSP processor. Methods: As our baseline architecture, we use a modified version of an existing ultra-low-power...
متن کاملParallel implementation of the wideband DOA algorithm on single core, multicore, GPU and IBM cell BE processor
The Multiple Signal Classification (MUSIC) algorithm is a powerful technique for determining the Direction of Arrival (DOA) of signals impinging on an antenna array.The algorithm is serial based, mathematically intensive, and requires substantial computing power to realize in real-time.Recently, multi-core processors are becoming more prevalent and affordable.The challenge of adapting existing ...
متن کاملAccelerating Video Feature Extractions in CBVIR on Multi-core Systems
With the explosive increase in video data, automatic video management (search/retrieval) is becoming a mass market application, and Content-Based Video Information Retrieval (CBVIR) is one of the best solutions. Most CBVIR systems are based on low-level feature extractions guided by the MPEG-7 standard for high-level semantic concept indexing. It is well known that CBVIR is a very compute-inten...
متن کاملColor Image Segmentation using Median Cut and Contourlet Transform : A Parallel Segmentation Approach
This paper presents a parallel implementation of color image segmentation algorithm using multiresolution technique. The idea is to achieve the complete and significant objects in the image using contourlet transform based image segmentation and to explore current multi-core architectures present in commercial processors in order to speed up the segmentation process for large size images. The a...
متن کاملAcceleration of Correlation Matrix on Heterogeneous Multi-Core CELL-BE Platform
A parallel implementation of the highly useful and computational intensive correlation matrix arithmetic is presented here. A heterogeneous multi-core CELL-BE processor platform has been used for the purpose of accelerating the performance of the kernel. An extensive measures of the performance reflects the massive capability of the CELL-BE processor architecture. In comparison with a Power Pro...
متن کامل